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ABSTRACT 



In this paper we estimate the Large Synoptic Survey Telescope (LSST) yield 
of eclipsing binary stars, which will survey ~20,000 square degrees of the south- 
ern sky during the period of 10 years in 6 photometric passbands to r ~ 24.5. 
We generate a set of 10,000 eclipsing binary light curves sampled to the LSST 
time cadence across the whole sky, with added noise as a function of apparent 
magnitude. This set is passed to the Analysis of Variance (AoV) period finder 
to assess the recoverability rate for the periods, and the successfully phased light 
curves are passed to the artificial intelligence-based pipeline ebai to assess the 
recoverability rate in terms of the eclipsing binaries' physical and geometric pa- 
rameters. We find that, out of ~24 million eclipsing binaries observed by LSST 
with S/N > 10 in mission life-time, ~28% or 6.7 million can be fully character- 
ized by the pipeline. Of those, ~25% or 1.7 million will be double-lined binaries, 
a true treasure trove for stellar astrophysics. 

Subject headings: Methods: data analysis, numerical, observational; Binaries: 
eclipsing, fundamental parameters; Techniques: photometric; Surveys: LSST 
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Introduction 



The importance of eclipsing binary stars (EBs) can hardly be overstated. Their 
analysis provides calibration-fr ee physical proper ties of stars (i.e. masses, radii, surface 



temp eratures, lumi nosities; cf. 



998) ; stellar ages (Thompson et al. 



( iStassun et al. 



Torres et al. 



2010l ); precise stellar distances (IGuinan et al. 



2010[ ); and stringent tests of stellar evolution models 
2009J). The products of state-of-the-art EB modeling are seminal to many 
areas of astrophysics, including: calibrating the cosmic distance scale; mapping clusters 
and other stellar populations (e.g. star-forming regions, streams, tidal tails, etc) in the 
Milky Way; determining initial mass functions and studying stellar population theory; 
understanding stellar energy transfer mechanisms (including activity) as a function of 
temperature, metallicity and evolutionary stage; calibrating stellar color-temperature 
transformations, mass-radius-luminosity relationships, and other relations basic to a broad 
array of stellar astrophysics; studying stellar dynamics, tidal interactions, mass transfer, 
accretion, chromospheric activity, etc. 

The upcoming decade promises to yield astronomical data that are unprecedented 
both in quantity as well as in ac curacy. The Pano ramic Survey Telescope & Rapid 



Response System (Pan-S 



(Uedicke k Pan-STARRS 



ARBS: 



Kaiser et al. 



20021 ) is conducting their first pilot study 



20071 . PS1) in which a quarter of all Pan-STARRS targets are 
observed in an effort to test the telescope, camera and processing software. Although 
the main objective of the upcoming PS2 stage of Pan-STARRS is to detect potentially 
hazardous objects in the Solar system, the by-product will be a 6-band ph otometr ic 



database of ~20 billion stars, each observed 20-60 times, visible from Hawaii ((Kaiser 



2005h . 



It is scheduled to see first 



operation duration. Gaia (ILindegren et al- 



ight in the Fall of 20 12. The survey has no restrictions on 



20081 ) is the next ESA cornerstone mission 



scheduled to launch in 2012; it will observe ~1 billion stars, each about 70 times during its 
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5-year mission. While Gaia is primarily an astrometric mission, the observations will be 
complemented by low-resolution spectrophotometric data in the 330-1000 nm wavelength 
range, the G passband photometry from the unfiltered light in th e astrometric field and, 



for the stars brighter than 17-mag, radial velocity measurem ents rtJordi et a 



expected yield of eclipsing binary stars has been studied by 



Munari et al. 



20 



fl). The 



fbOQlh who 



estimate that Gaia w ill observe ~100,000 double-lined eclipsing binaries. Already, Kepler 



(IBorucki et al. 



20101 ) has detect ed and character ized 1879 eclipsing and ellipsoidal binaries 



in a sample of ~ 156, 000 stars (IPrsa et al. 



201 lh . 



The Large Synoptic Survey Telescope (llvezic et al. 



2008 



LSST) is a large, wide-field 



system designed to obtain multiple images covering the southern sky. LSST is scheduled 
to see first light in mid-2018. The current baseline design, with an effective 6.5 m primary 
mirror, a 9.6 deg 2 field of view, and a 3.2 Gigapixel camera, will allow about 20,000 square 
degrees of sky to be covered using pairs of 15-second exposures in two out of six SDSS 
(ugrizy) photometric ba nds every three night s on average, with typical 5a depth for point 
sources of r ~ 24.5. The ILSST Science Book! (§6.10, pg. 171) includes a discussion on the 
importance of EBs to the LSST science case, and includes a preliminary estimate of the 
EB yield. It proposes a metric based on the number of visits (pairs of 15-s observations) in 
the eclipses and provides qualitative results that indicate LSST's near-perfect efficiency for 
short-period binaries, ~50% efficiency for periods up to 30 days, and a ~10% efficiency for 
the longer periods. Here we revise the estimate in significantly greater detail and provide 
quantitative results. 

The paper structure is as follows: in §2 we present the details of the simulation, in §3 
we discuss the recoverability rate for the ephemerides and for physical parameters; in §4 we 
summarize the results and conclude in 55. 
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2. Model LSST Light Curves and Analysis 



In this section we describe our procedures to estimate L SST's EB yield. First we 



created a set of model EB light curves employing PHOEBE (jPrsa & Zwitter 



20051 ). a 



Wilson fc Devinneyl ( I197ll ) based eclipsing binary modeling su ite. Next, we sam pled these 



20091) over a 



light curves according to LSST's so-called universal cadence (jCook et al. 
nominal 10-year life span of the mission. We then added noise as a function of apparent 
magnitude, where the distribution of magnitudes was drawn from the Sloan Digital Sky 
Survey (SDSS) luminosity function. The light curves were then passed to a standard period 
finder to determine the ephemerides, and phased light curves were processed by our neural 
network-based engine, ebai, to automatically estimate principal parameters of the systems. 
Finally, we compared these estimated parameters with the ones actually used to create 
the sample, which provided us with the LSST efficiency metric for the EB yield. We now 
describe these steps in turn. 



2.1. Input Light Curve Parameters 

We synthesized 10,000 detached eclipsing binary light curves, described by their 
ephemerides (HJD and period Pq) and 5 principal parameters: T 2 /Ti, pi + p 2 , esino;, 
ecosw and sinz. The temperature ratio T 2 jT\ serves as a proxy to the surface brightness 
ratio B 2 /Bi that directly determines the ratio of depths of both eclipses. The sum of 
fractional radii pi + p 2 = (Ri + R 2 )/a determines the baseline width of the eclipses. The 
radial eccentricity projection esinw, where u is the argument of periastron, determines 
the ratio between the secondary and the primary eclipse widths. The phase separation 
of the eclipses is proportional to the tangential eccentricity projection ecosu, and the 
overall amplitude of the light curve, as well as the shape of eclipses (U- or V-shaped), 
are determined by sini. The limb darkening parameters were automatically interpolated 
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from Prsa et al. (2011; in prep.) based on component temperature and effective gravity, 
and gravity darkening exponents to 1.0 and 0.32 for radiative and convective envelopes, 



respective 



Prsa et al. 



y . For a thorough discussion on the choice of principal parameters refer to 



fl2008h . 



The values of principal parameters were randomly sampled from the following 
probability distribution functions (PDFs; cf. Fig. [T]): 

• T 2 /Ti is sampled from a normal distribution £7(1.0,0.18); 

• Pq is sampled from a log-uniform distribution [—1,4]; 

• HJDo is sampled from a uniform distribution [0, Po]; 

• Pi + P2 is sampled from a uniform distribution [0.05,<5 max — 0.05], where <5 max is the 
morphology constraint parameter that depends exponentially on the value of logP: 

(W(logPo) = 0.7 exp I — 

• eccentricity e is sampled from an exponential distribution £(0.0, e max ), where e max is 
the attenuation parameter that depends exponentially on the value of p% + p<i- 

( pi + p 2 -0.05\ 
e m ax(pi +P2) = 0.35 exp I — \ ; 

• argument of periastron u is sampled from a uniform distribution [0, 27r]; 

• sine of the inclination sini is sampled from a uniform distribution [sini grazing , 1], where 
grazing is the inclination of a grazing eclipse: 



sini grazing = >/l - (pi + p 2 ) 2 . 
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Fig. 1. — Theoretical probability distribution functions for T 2 /Ti, pi + p 2 , e, w, and i. 
Principal parameters of the training set and of the test set for 10,000 EBs were sampled 
from these distributions. 
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The reason why we did not simply use uniform distributions in all parameters and thus 
cover the whole parameter space is to avoid unphysical light curves. Such light curves, when 
they comprise a significant fraction of the whole sample, have a notable adverse impact on 
the netwo rk's recognitio n ability, especially for second-order parameters such as esino; and 



ecoso; ( iPrsa et al. 



20081). 



2.2. Input Light Curve Cadence 

LSST will survey the sky according to the so-called universal cadence. This scanning 
model is optimized for the uniform observed depth of r ~ 24.5 and a unifor m number of 



visits across 20,000 square degrees of the southern sky ( iLSST Science Bookl . §2.1). A visit 
is defined as a pair of 15 s exposures, performed back-to-back, with a 4 s readout separation, 
to aid in cosmic ray rejection. Most fields will be observed twice during the nightly run, 
with visits separated by 15-60 minutes. This is done to maximize the sensitivity to motion 
of solar system objects but also to benefit the detection of short period stellar variability. 
The universal cadence excludes visits of ~1,000 square degrees around the galactic center 
because of crowding. 

To simulate this universal cadence, we made use of the Operations Simulator opsim 
1 . 29 that serves to evaluate the suitability of t he scanning model an d to quantify the 



yield for individual scientific goals of the survey ( ILSST Science Bookl §3.1). The simulator 



provides an array of heliocentric Julian dates (HJD) at which a given field will be observed. 
We split the southern sky uniformly in declination: 



Si = x 100° - 90° for i = 1 . . . N, 

where N is the number of declination bands. The z-th band is then split into M, right 
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ascensions: 



Mi = l + int (1.3 x 2N sin Si); 
J ~ 0.5 



Mi 



x 360° for j = 1 . . . M { . 



The factor 1.3 in the expression for Mi makes the distribution in a 30% denser to 
account for the pronounced universal cadence variability along a. Sky partitioning in this 
way yielded 1558 fields for N = 30, covering all right ascensions and declinations between 
—90° and 10°. Fig. [2] depicts the number of visits per field in the r band. The numbers 
typically vary from almost 600 points per lightcurve close to the celestial equator, down to 
300 points per lightcurve at higher declinations. Fig. |3] shows several examples of the LSST 
phase coverage on EB light curves with different orbital periods. 



2.3. Luminosity Function and Noise Model 



After the application of the LSST cadence to the model lightcurves, we then apply 
a combination of apparent luminosity function and noise model to obtain data similar 
to what we expect to emerge from the LSST data pipeline. We first derive a luminosity 
functio n from the SPSS, takin g a random selection of stars from the SDSS Data Release 7 



(DR7) ( lAbazajian et al 



20091 ). We randomly selected 18,000 stars at all magnitudes and 
sky positions from the DR7 footprint. We then fit a simple polynomial to the distribution 
of r-band magnitudes to model the luminosity function of field stars, and randomly assign 
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LSST number of field visits in the r band 
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Fig. 2. — The number of LSST r-band visits (pairs of 15-s exposures) across the sky, accord- 
ing to the opsiml . 29 universal cadence. The operations simulator has a sophisticated model 
of the telescope and dome and outputs the cadences (heliocentric Julian dates) per passband 
as a function of right ascension and declination of the field. The nominal sky coverage is 
20,000 square degrees across the southern hemisphere, and averages to ~230 visits per field. 
The dashed line depicts the galactic equator. 
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Fig. 3. — Nine eclipsing binary light curves depicting the LSST cadence. The dashed line 
is a theoretical curve with no noise or period uncertainty added. The points are individual 
visits in the r band. The panels are arranged from the shortest (top left) to the longest 
period (bottom right): 1.42-d, 1.80-d, 2.06-d, 2.53-d, 6.31-d, 6.72-d, 15.2-d, 57.9-d, and 109- 
d. The phase coverage becomes a problem towards longer periods: there is a single data 
point acquired during the primary eclipse of the 57.9-d binary, and no data points during 
the secondary eclipse of the 109-d binary. For detached EBs it is the number of data points 
in eclipse that predominantly govern period and parameter determinability. 
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Table 1: Theoretical LSST Noise Model (Howell 2009, priv. comm.) 



r magnitude 


Expected per-point 




RMS error (magnitude) a 


16.0000 


0.0011 


16.5000 


0.0014 


17.0000 


0.0018 


17.5000 


0.0023 


18.0000 


0.0030 


18.5000 


0.0039 


19.0000 


0.0051 


19.5000 


0.0070 


20.0000 


0.0097 


20.5000 


0.0138 


21.0000 


0.0202 


21.5000 


0.0303 


22.0000 


0.0461 



a Based on an exposure time of 15s. 



apparent magnitudes to our sample lightcurves with a distribution based on the SDSS 
luminosity function in the range 16 < r < 22. 

We then apply Gaussian white noise to the lightcurves. To match the assigned 
magnitude to an expected photometric error, we use a theoretical LSST noise model (S. 
Howell 2009, private communication). That noise model is shown in Tabled! 
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3. Recovery of EB Parameters from LSST Lightcurves 

With a set of EB lightcurves representative of what LSST will observe, our next step is 
to determine how well we can recover the physical parameters of the EBs. We attempt this 
recovery with two separate tests. First, we apply an off-the-shelf period- finding algorithm 
to obtain the correct period and phase the lightcurves. Second, we take the lightcurves 
for which the true period was recovered within a specified tolerance, and use a neural 
network-based method to attempt to recover the physical parameters of the system. Note 
that we do not attempt to simulate the performance of light curve classi fiers; rather, we 
assum e that automated light curve classification is done successfully (see 



Debosscher et al. 



20071 for general efficiency estimates). We plan to explore the classification success in detail 



in the follow-up paper. 



3.1. Period Recovery 



Successful determination of the orbital period depends on many factors, and a variety 
of diff erent methods have been utili zed to identify their periods, such as analysis of varian ce 



(AoV; 



see 



S ch warzenb erg- Czernv 



Devoi 



2005 



Hartman et al 



1989) o r box- fitting least-square (BLS; 



Kovacs et al. 



2008 



20021 ) 



20091 for examples). For the purposes of this analysis, 
we use the AoV method only. We acknowledge that other methods will likely be able 
to improve upon the detection efficiencies we find here, especially in different regions of 
parameter space. We therefore present the following results as a lower limit in terms of 
period recovery fraction. 

)o1eB utilft 



In our implementation, we utilize the vartoolqj utility (lHartman et al. 



2009J). We 



use the AoV algorithm to search for periodicities in all 10,000 lightcurves, with a period 



1 http:/ /www. cfa.harvard.edu/ j hartman/ vartools/ 
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search range of 0.5 to 1000 days, and subsample and finetune parameters of 0.1 and 0.01, 
respectively. We record the output periodigram of each lightcurve. 

The criterion we use to determine a "successful" period recovery is the relation 
k = 6t/P 2 (Oluseyi 2010, private communication). This formula states that for any points 
in a phase-folded lightcurve, the maximum offset from their correct phase k, expressed as 
a fraction of the true period, is the fractional offset between the measured period P m and 
the true period P, times the number of cycles in the lightcurve. The fractional offset is 
/ = S/P, where 5 = \P m — P\, and the number of cycles is the lightcurve duration r divided 
by the true period P. The parameter k reflects the maximum offset within the phased 
lightcurve of the most discrepant points. By testing various values for k, we find that a 
requirement of k < 10% returns the largest number of correctly recovered periods. We thus 
require that k < 10%, so 5 < 0.1P 2 /t. To determine whether an EB is recovered via the 
AoV method, we consider the period of the highest peak in the periodigram (P&). Since EB 
lightcurves typically show more power at half the true period than the true period itself, we 
use twice the peak period as the matching value. Thus, we define P m = IP^. 

We find that successful period recoverability is strongest for short period EBs and 
declines relatively evenly in log P. This is expected since phase coverage for short 
period EBs is more complete than for long period EBs (cf. Fig. [3]). The results from 
the AoV analysis for different EB types are shown in Figure HI In this plot, we split up 
the input lightcurve sample according to the physical parameters of the EBs, based on 
the parameter combination (pi + p 2 )P 2 ^ 3 , which is roughly the sum of the radii of the 
components, ~ + R 2 ). We separate the sample into Dwarfs + R 2 < 5R Q ) and 
Giants + R 2 > 5R Q ). The dashed lines show the distribution of input systems and the 
solid lines show the recoverability, as a function of period. 

Any attempt at period recovery for ground-based data typically encounters aliasing 
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Fig. 4. — The period recovery rate of eclipsing binary stars with the AoV technique, based 
on the luminosity class of the components and orbital period. The period recovery criterion 
is k < 10%. See §3.11 for further details of the recovery criteria. 
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effects at integral fractions and multiples of 1 day. For systems where the true period is 
very close to those periods, recovery tends to decrease strongly. In FigHJ the resolution in 
period space is too large to see significant aliasing effects. While such effects are inherently 
present in LSST data, they are actually marginal due to the LSST cadence. Because of the 
combination of infrequent sampling (1-3 visits/night every ~20 nights) and the very long 
time baseline (10 years), aliasing effects at the diurnal cycle are fairly small compared to 
higher-cadence and/or single-season campaigns. 

It may seem surprising that we are able to accurately recover EB periods as long 
as ~1000 d. We find that in many of these cases, particularly where one of the stellar 
components is a giant, it is the ellipsoidal variations that provide the strong observational 
handle on the system's period. In an EB where one component is a giant star, the giant can 
fill a large fraction of the semi-major axis even though the orbital period may be quite long. 
The proximity of the companion star can then raise tidal bulges on the giant star that lead 
to small but observable light curve variations on the orbital period. Mo re quantitatively, th e 



expec ted peak-to-peak ellipsoidal variation is ~ 2M2/ M\{R\ sini/a) 3 ( Ivan Kerkwijk et al. 



20101 ). Figure [5] shows three example light curves from our simulations that illustrate the 
above effects. In each panel, we list the semi-major axis (in solar radius units), the relative 
size of the primary star (Rl/a), and the level of expected peak-to-peak variations. For 
example, the middle panel illustrates the case of an EB with a = 325.4 R Q (P Q rb = 334 d) 
and R\ = 55.3 R Q . The expected level of ellipsoidal variation is ~1%, and this is indeed 
what we observe in the light curve. For the minimum photometric signal-to- noise adopted 
in our simulations, variability amplitudes of ~1% or larger should yield easily recovered 
periods by LSST. 

We note that the relative proportion of giants in our test lightcurves (20%) is slightly 
higher than the proportion in the Galaxy (5% - 15%, see Girardi et al. 2011, in prep). That 
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difference means that the actual recoverability of long-period binaries of all types would be 
somewhat lower than the bottom panel in Figure |U 



3.2. Accuracy of Eclipsing Binary Parameter Determination 

Neural networks will perform exceptionally well for light curves whose parameters are 
within the boundaries of the training set (especially in terms of non-linear interpolation), 
but the performance will deteriorate exponentially when extrapolation is required 



([Freeman fc Skapura 



19911 ). To assure optimal operation, we trained the network on as 



wide a range of physically plausible light curves as possible (viz. Fig. [p. 

The EBs with successfully recovered periods were passed to the artificial int elligence 



based engine ebai (Eclipsing Binaries via Artificial Intelligence; 



Prsaetal 



20081 ) to derive 



physical parameters of the systems: T 2 /Ti, p\ + esinw, ecosw, and smi. We first use 



polyfit (jPrsa et al 



20081 ) to find an analytical approximation for each light curve. This is 



necessary because the neural network takes flux values in equidistant phase steps as input. 
The polyfit algorithm divides the phased light curve into N intervals and fits an n-th 
order polynomial to the data in each interval; it then connects the polynomials into a chain 
by imposing two conditions: connectedness at chain knots and smooth chain wrapping 
on phase space boundaries. To find the best fit, polyfit perturbs the position of chain 
knots; any knot positioning that improves the value of x 2 is adopted. This way a smooth 
polynomial chain with N discontinuities in the first derivative represents EB data optimally. 
For the purpose of this simulation, second order polynomials were fit to 4 segments (i.e. 2 
segments in eclipse and 2 segments between eclipses). The polyfits were then passed to 
ebai. 



Backpropagation network training, the only computationally intensive part of ebai, 
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needs to be performed only once for a given passband; this was done for the Sloan r band 
on a 24-node Beowulf cluster using a parallelized version of the code - the task took ~2 
days for 1 million iterations. Once trained, the network is able to process thousands of light 
curves in a fraction of a second on most computers. In particular, 10,000 light curves used 
in this simulation were processed in 0.5 s on a 2.0GHz laptop, where most of this time was 
spent on I/O operations. 

The initial test on a set of light curves with no noise or period uncertainty returned 
parameters with less than 10% error for 75% of all stars (ILSST Science Bookl . §6.10, 
pg. 176). The limiting factor is the limited number of visits that decreases the recoverability 
rate for long period EBs (due to a small number of data points in eclipses; cf. Fig. [3]). 

The situation with added noise and period uncertainty becomes significantly worse. 
Fig. [6] depicts the results of ebai for the 10,000 simulated light curves used in this paper: 
~10% of all parameters deviate from their true values by less than 2.5%; ~10% by less than 
5%, and ~28% by less than 10%. A 10% error might seem large (typical error estimates 
of state-of-the-art EB modeling are close to 1-2%), however ebai serves to provide an 
initial estimate for parameter values that would subsequently be improved by model-based 
methods such as Differential Corrections or Nelder & Mead's Simplex, as implemented in 
PHOEBE. The simulation indicates that this will be readily possible for about a quarter of 
the sample. 



3.3. Overall Recovery 



Combining the AoV period recovery and the parameter recovery provides the overall 
LSST yield. To set a threshold for "correct" parameter recovery, we compare the recovered 
value versus the true value for all 5 lightcurve parameters, and compute the percentage 
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differences for each parameter. We consider those systems for which the average of the five 
parameter percentage differences is below 10% to be recovered. The results are shown in 
Figure [7J As stated above in 13.11 the true recoverability fraction at longer periods would be 
lower than shown in [7J, due to the smaller proportion of giants compared to the fraction in 
our test lightcurves. Recoverability decrease towards longer periods is predominantly due 
to incorrectly determined periods; the suppression in recoverability due to failed parameter 
estimation does not significantly depend on the period of the binary. 



4. Conclusion 

The Large Synoptic Survey Telescope has the potential to dramatically impact the 
current state of stellar astrophysics. As the simulations described above demonstrate, LSST 
will detect a majority of the EBs with orbital periods less than 1 day, and substantial 
fractions of those with periods up to ~100 days. This completeness estimate is based on 
analysis of a single passband; simultaneous analysis of all six LSST bands will in reality 
improve this completeness. With a nominal detection limit of r = 24.5, a magnitude of 
r = 22.0 should allow detection of targets with a S/N of 10, r = 19.5 will have S/N of 100. 
Table [2] shows the distance out to which certain fiducial EB types can be detected. For 
example, a pair of eclipsing M2 dwarfs will be detected out to 1 kpc with S/N of 100. 



We can estimate the number of EBs that LSST will be able to fully characterize. 
Experience modeling EB light curves shows that S/N ~ 10 per data point typically suffices 
for the determination of physical and geometric parameters t o a few percent. Gaia will 



observe ~1 billion stars down to r ~ 20.5 over the whole sky (ILindegren et al 



20081 ). We 



can expect that LSST will observe ~0.5 billion stars to this same depth in the southern 
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Table 2: Distance Limits for LSST Detection of Sample EBs. 



Sample B 


nary" Type 


Binary Absolute Magnitude 


Distance 6 for r = 22.0 [kpc] 


Distance 6 for r = 19.5 [kpc] 


M5V 


+ M5V 


12.9 


0.7 


0.2 


M2V 


+ M2V 


9.0 


1.0 


1.3 


KOV 


+ KOV 


5.0 


25.1 


7.9 


G2V 


+ MxV 


4.6 


30.2 


9.5 


G5III 


+ GxV 


2.9 


66.1 


20.1 



"Scientifically interesting EB systems. EBs with M-dwarf components are rare in the literature. Their discovery will 
permit detailed testing of stellar models in this important mass regime. G-dwarf/M-dwarf pairs will be particularly 
valuable for pinning down the properties of M-dwarfs, since the temperature scale of G-dwarfs is relatively well 
established. A particularly exciting prospect are Cepheids (G giants) in EB systems. 
6 Assuming no extinction. 



hemisp here; scaling this to r ~ 22.0 yields ~2 billion stars. Extrapolating the results from 



Kepler (jPrsa et al. 



2011( 1 that observed 1879 EBs in the sample of 156,000 stars (1.2%), 
the LSST sample will contain ~24 million EBs with S/N > 10. The average detection rate 
(see section 13.31) for EBs over all periods will be around 28%, bringing the total number 
to ~6.7 million EBs. Roughly 25% of those will have components of similar luminosities 
(double-lined systems), yielding ~1.7 million EBs with S/N > 10 for ready detailed 
modeling. 

The 28% success rate seems low, but the following needs to be taken into consideration: 
1) the recovery rate is better than 50% for short period systems (P < 1-d) and can be 
improved to better than 60% by optimizing the period finder; 2) the cases where the period 
finder and/or the neural network fail may be identified by simply evaluating the \ 2 f° r 
each light curve, so that statistical results can be properly weighted. Thus, ~7 million EBs 
with parameter uncertainties less than 10% may be readily identified; and 3) while for the 
longer period EBs the inherent cadence limitation prevents recoverability, shorter period 
EBs will be passed to a physical model for differential corrections. These can improve the 
recoverability substantially whenever the initial ebai parameter estimates are close (within 
10%) of the optimal solution. 
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While LSST will revolutionize the numbers of detected and characterized EBs, our 
simulations predict a strong selection effect w.r.t. the orbital period. With only several 
hundred data points in the course of 10 year mission span, the probability of sampling a long 
period binary during eclipses becomes notably small and suppresses detection, classification 
and characterization success to under 10%. Another identified deficiency of the sample 
due to a limited cadence is the lack of ability to characterize any intrinsic variability such 
as spots, accretion disks and other transient phenomena. Thus, we foresee that the main 
LSST legacy will be to make a census and statistical analysis of millions of galactic and 
extragalactic EBs, but the burden of careful object-by-object studies based on the prime 
astrophysical importance will fall on the follow-up program. 
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Fig. 5.— Simulated noiseless light curves for three EBs with evolved (i.e., red giant) 
primary stars that present ellipsoidal variations despite having large semi-major axes (i.e., 
long orbital periods). In each panel is listed the EB semi- major axis (in solar radius units), 
the relative radius of the primary star (Ri/a), and the predicted peak-to-peak variations due 
to ellipsoidal effects. Horizontal dotted lines show the predicted ellipsoidal amplitude also. 
The vertical scale has been set to emphasize the low-level ellipsoidal variation; some data 
points in eclipse are lost below the figure in each case. 
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Fig. 6. — Left: comparison between input and ebai-computed values of parameters. For 
neural network optimization purposes, parameter values were rescaled to the [0.1, 0.9] inter- 
val. Successive parameters are vertically offset by 1.0 and correlation guidelines are provided 
to facilitate comparison. Right: histogram of the percent residuals computed by ebai. The 
inset depicts a cumulative distribution of the residuals. The recovery rate is ~10% to abso- 
lute errors better than 2.5%, and ~28% to absolute errors better tha n 10%. In compa rison, 



a typical recovery rate for OGLE sta rs is ~80% to a 10% precision (jPrsa et al 



^90% to a 10% precision for Kepler (jPrsa et al. 



20081 ). and 



201lh . The dominant reason for the lower 



recoverability rates for LSST when compared to OGLE and Kepler is a significantly smaller 
number of field visits and thus a poorer phase coverage - cf. Fig. [2j 



-24- 



T 



T 



~i — i — i i i i 



H 1 — III 



Periods recovered for k < 10% 
Periods recovered precisely, 

params recovered at 10% threshold 
Periods recovered at k < 10%, 

params recovered at 10% threshold 



^ 
0.1 




J I I I I I I I I I I I I I I I I I I I 



10 

Period (days) 



100 



1000 



Fig. 7. — The overall yield of eclipsing binary stars for LSST. Periods are successfully 
recovered when the most discrepant points in the lightcurve are off by less than 10% in 
phase (k < 10%, see §3.ip . For parameter recovery the average of the percent difference 
between the true and recovered values for each of the five EB parameters is below 10%. 
The solid curve is the same as in the bottom panel of Fig. H] and is reproduced here for 
convenience. It is evident that the bulk of suppressed recoverability comes from the failed 
period estimations. 
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